Successive over-relaxation

In numerical linear algebra, the method of successive over-relaxation (SOR) is a variant of the Gauss–Seidel method for solving a linear system of equations, resulting in faster convergence. A similar method can be used for any slowly converging iterative process.

It was devised simultaneously by David M. Young, Jr. and by H. Frankel in 1950 for the purpose of automatically solving linear systems on digital computers. Over-relaxation methods had been used before the work of Young and Frankel. For instance, the method of Lewis Fry Richardson, and the methods developed by R. V. Southwell. However, these methods were designed for computation by human calculators, and they required some expertise to ensure convergence to the solution which made them inapplicable for programming on digital computers. These aspects are discussed in the thesis of David M. Young, Jr..[1]

Contents

Formulation

Given a square system of n linear equations with unknown x:

A\mathbf x = \mathbf b

where:

A=\begin{bmatrix} a_{11} & a_{12} & \cdots & a_{1n} \\ a_{21} & a_{22} & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\a_{n1} & a_{n2} & \cdots & a_{nn} \end{bmatrix}, \qquad  \mathbf{x} = \begin{bmatrix} x_{1} \\ x_2 \\ \vdots \\ x_n \end{bmatrix} , \qquad  \mathbf{b} = \begin{bmatrix} b_{1} \\ b_2 \\ \vdots \\ b_n \end{bmatrix}.

Then A can be decomposed into a diagonal component D, and strictly lower and upper triangular components L and U:

A=D%2BL%2BU,

where

D = \begin{bmatrix} a_{11} & 0 & \cdots & 0 \\ 0 & a_{22} & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\0 & 0 & \cdots & a_{nn} \end{bmatrix}, \quad L = \begin{bmatrix} 0 & 0 & \cdots & 0 \\ a_{21} & 0 & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\a_{n1} & a_{n2} & \cdots & 0 \end{bmatrix}, \quad U = \begin{bmatrix} 0 & a_{12} & \cdots & a_{1n} \\ 0 & 0 & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\0 & 0 & \cdots & 0 \end{bmatrix}.

The system of linear equations may be rewritten as:

(D%2B\omega L) \mathbf{x} = \omega \mathbf{b} - [\omega U %2B (\omega-1) D ] \mathbf{x}

for a constant ω > 1.

The method of successive over-relaxation is an iterative technique that solves the left hand side of this expression for x, using previous value for x on the right hand side. Analytically, this may be written as:

 \mathbf{x}^{(k%2B1)} = (D%2B\omega L)^{-1} \big(\omega \mathbf{b} - [\omega U %2B (\omega-1) D ] \mathbf{x}^{(k)}\big).

However, by taking advantage of the triangular form of (D+ωL), the elements of x(k+1) can be computed sequentially using forward substitution:

 x^{(k%2B1)}_i  = (1-\omega)x^{(k)}_i %2B \frac{\omega}{a_{ii}} \left(b_i - \sum_{j>i} a_{ij}x^{(k)}_j - \sum_{j<i} a_{ij}x^{(k%2B1)}_j \right),\quad i=1,2,\ldots,n.

The choice of relaxation factor is not necessarily easy, and depends upon the properties of the coefficient matrix. For symmetric, positive-definite matrices it can be proven that 0 < ω < 2 will lead to convergence, but we are generally interested in faster convergence rather than just convergence.

Algorithm

Inputs: A , b, ω
Output: \phi

Choose an initial guess \phi^{(0)} to the solution
repeat until convergence

for i from 1 until n do
\sigma \leftarrow 0
for j from 1 until i − 1 do
 \sigma \leftarrow \sigma %2B a_{ij} \phi_j^{(k%2B1)}
end (j-loop)
for j from i + 1 until n do
 \sigma \leftarrow \sigma %2B a_{ij} \phi_j^{(k)}
end (j-loop)
 \phi_i^{(k%2B1)} \leftarrow (1-\omega)\phi_i^{(k)} %2B \frac{\omega}{a_{ii}} (b_i - \sigma)
end (i-loop)
check if convergence is reached

end (repeat)

Note:
(1-\omega)\phi_i^{(k)} %2B \frac{\omega}{a_{ii}} (b_i - \sigma) can also be written \phi_i^{(k)} %2B \omega \left( \frac{b_i - \sigma}{a_{ii}} - \phi_i^{(k)}\right), thus saving one multiplication in each iteration of the outer for-loop.

Symmetric Successive Over Relaxation

The version for symmetric matrices A, in which

 U=L^T,\,

is referred to as Symmetric Successive Over Relaxation, or (SSOR), in which

P=\left(\frac{D}{\omega}%2BL\right)\frac{\omega}{2-\omega}D^{-1}\left(\frac{D}{\omega}%2BU\right),

and the iterative method is

\mathbf{x}^{k%2B1}=\mathbf{x}^k-\gamma^k P^{-1}(A\mathbf{x}^k-\mathbf{b}),\ k \ge 0.

The SOR and SSOR methods are credited to David M. Young, Jr..

Other applications of the method

A similar technique can be used for any iterative method. If the original iteration had the form

x_{n%2B1}=f(x_n)

then the modified version would use

x^\mathrm{SOR}_{n%2B1}=(1-\omega)x^{\mathrm{SOR}}_n%2B\omega f(x^\mathrm{SOR}_n)

or equivalently

x_{n}=\omega x_{n-1} %2B (1-\omega)x_{n-2}

Values of \omega>1 are used to speedup convergence of a slow-converging process, while values of \omega<1 are often used to help establish convergence of a diverging iterative process.

There are various methods that adaptively set the relaxation parameter \omega based on the observed behavior of the converging process. Usually they help to reach a super-linear convergence for some problems but fail for the others.

See also

Notes

References

External links